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Pattern  matching  in  trees  is  fundamental  to  a  variety  of  programming 
language  systems.  However,  progress  has  been  slow  in  satisfying  a  pressing  need 
for  general  purpose  pattern  matching  algorithms  that  are  efficient  in  both  time  and 
space.  We  offer  asymptotic  improvements  in  both  time  and  space  to  Chase’s 
bottom-up  algorithm  for  pattern  preprocessing.  Our  preprocessing  algorithm  has 
the  additional  advantage  of  being  incremental  with  respect  to  pattern  additions  and 
deletions.  We  show  how  to  modify  our  algorithm  using  a  new  decomposition 
method  to  obtain  a  spaceAime  tradeoff.  Finally,  we  trade  a  log  factor  in  time  for  a 
linear  space  bottom-up  pattern  matching  algorithm  that  handles  a  wide  subclass  of 
Hoffmann  and  O’Donnell’s  Simple  Patterns,  z' 

(  -'r  — 

1.  Introduction 

Pattern  Matching  in  trees  is  fundamental  to  term  rewriting  systems  [11],  transformational 
programming  systems  [4,7,18,22],  program  editing  and  development  systems  [6,13],  code  gen¬ 
erator  generators  [9,17],  theorem  provers[14],  logic  programming  optimizers  that  attempt  to 
replace  unification  with  matching[16],  and  compilers  for  ML[21],  Haskell[12],  and  a  variety  of 
functional  languages  with  equational  function  definitions.  However,  this  problem  seems  to  be 
extremely  difficult.  The  best  known  space-efficient  top-down  algorithm  to  locate  all  occurrences 
of  a  pattern  tree  of  size  /  in  a  tree  of  size  n  takes  O(nl15pofylog(l))  time,  a  recent  result  due  to 
Kosaraju  [15],  which  is  barely  better  than  the  naive  O(nl)  algorithm.  Bottom-up  pattern  matching 
seems  to  be  even  more  difficult  than  top-down  matching  and  is  of  special  practical  importance.  In 
a  seminal  paper  Hoffmann  and  O’Donnell  presented  bottom -up  tree  pattern  matching  algorithms 
that  were  highly  efficient  in  time  but  required  excessive  space  [10]  both  in  theory  and  practice  (see 
Chase’s  empirical  data  [5]).  Hoffmann  and  O’Donnell's  work  has  stimulated  a  number  of  papers 
offering  heuristic  space  improvements  [2,3,5, 19],  and  Chase’s  method  has  aroused  considerable 
attention  [5].  However,  none  of  these  papers  offer  theoretical  improvements  or  promising 
space/time  tradeoffs. 
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In  this  paper  we  present  two  new  theoretical  results  in  bottom-up  tree  matching. 

1.  At  the  end  of  his  CAAP  ’88  paper  [3]  Burghardt  called  for  an  algorithm  that  could  prepro¬ 
cess  pattern  trees  incrementally  as  worthwhile  future  research.  Such  an  algorithm  is  needed  in  the 
RAPTS  transformational  programming  system  [4],  because  incrementally  modifying  systems  of 
rewrite  rules  is  a  frequent  activity,  and  preprocessing  full  sets  of  patterns  is  highly  expensive. 

In  this  paper  we  present  a  modification  to  Chase’s  algorithm  so  that  its  costliest  task,  prepro¬ 
cessing,  can  be  achieved  incrementally  with  respect  to  additions  and  deletions  of  patterns.  When 
our  algorithm  is  applied  repeatedly  to  solve  nonincremental  preprocessing  by  adding  one  pattern  at 
a  time  starting  from  the  empty  set,  it  runs  asymptotically  better  in  time  and  space  than  Chase’s 
algorithm. 

2.  In  bottom-up  pattern  matching,  the  main  difficulty  that  sorely  needs  to  be  overcome  is 
space  utilization.  We  present  an  algorithm  for  a  subclass  of  Hoffmann  and  O’Donnell’s  Simple 
Patterns  that  runs  in  0(1)  space  overall  and  OQog  I)  time  per  step,  where  /  is  a  parameter  related  to 
the  number  of  input  patterns.  Previous  bounds  due  to  Hoffmann  and  O’Donnell  are  0(l2)  time  and 
space  for  an  algorithm  tailored  to  binary  Simple  Patterns  (which  our  subclass  properly  includes) 
and  0(lkmax+l)  space  with  0(1)  step  time  for  an  algorithm  handling  all  Simple  Patterns,  where 
kmax  is  the  maximum  arity  of  a  pattern.  Thus,  we  offer  a  quadratic  space  improvement  over  the 
latter  algorithm  for  binary  patterns  and  even  more  dramatic  improvement  for  patterns  of  greater 
arity.  Our  space  compression  is  obtained  by  applying  persistent  data  structures  in  a  new  way. 

2.  Bottom-up  pattern  matching 

Hoffmann  and  O’Donnell  [10]  define  patterns  inductively  as  follows: 

Definition:  Given  an  alphabet  of  one  distinguished  variable  v  and  function  symbols  /  with 
fixed  arity  A(f),  then  the  set  of  patterns  is  the  smallest  set  of  terms  that  include  (i)  v,  (ii)  constant  c 
if  c  is  a  function  symbol  with  arity  0,  and  (iii)  f(p  j ,  •  •  • ,  p*),  which  we  call  an  / -pattern,  if /is  a 
function  symbol  of  arity  k  andpi,  •  •  • ,  p*  are  patterns. 

They  also  define  pattern  matching  as  follows: 

Definition:  Pattern  p\  is  said  to  be  more  general  than  pattern  p2,  denoted  by  pi  <tp2,  iff 
either(i)  pi  isv,or(ii)p]  is/(xi,  •  •  • ,  xk),p2  is/(yi,  •  •  •  ,yk)  andx,  £y,  for  i  =  1,  •••,*. 

If  p i  2: P2,  we  also  say  that  p  i  matches p2  or  that  [pi,p2]  is  a  match.  The  set  of  subexpres¬ 
sions,  or  subpattems,  of  p  is  denoted  by  sub(p).  A  slightly  more  general  form  of  the  main  pattern 
matching  problem  considered  by  Hoffmann  and  O’Donnell  is: 

Problem(Multi-pattem  matching):  Given  a  set  P  of  patterns  and  a  pattern  t  called  the  subject, 
find  the  set  MPTM(t) -  {[p,  q\. p  e  P,  q  e  sub(t)  I  p  2  q)  of  all  patterns  in  P  matching  subpat¬ 
tems  of  t. 

As  the  preceding  notation  suggests,  bottom-up  solutions  presented  by  Hoffmann  and 
O’Donnell  and  Chase  treat  the  set  P  of  patterns  as  fixed  and  the  subject  t  (which  for  them  has  no 
variables)  as  the  only  parameter  that  can  vary.  In  a  bottom-up  strategy  to  solve  the  multi-pattern 
matching  problem,  a  complete  set  MPTM  (q)  of  matches  is  found  for  each  subpattem  q  of  t  without 
reference  to  any  subpattem  of  t  that  properly  encloses  q. 

In  order  to  explain  these  algorithms,  we  need  to  first  present  a  few  definitions  and  notational 
conventions. 

Definition:  If  P  is  a  set  of  patterns,  then  the  pattern  forest  PF  of  P  is  the  set  of  subpattems  of 
all  the  patterns  in  P. 
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Definition:  If  PF  is  the  pattern  forest  for  a  set  P  of  patterns  and  t  is  the  subject,  then  the 
match  set  MS  (f)  for  t  is  defined  by  the  role  MS  it)  =  [q  e  PF  I  q  £  r) . 

The  main  idea  underlying  Hoffmann  and  O'Donnell’s  bottom-up  algorithm  is  the  following 
equivalent  recursive  definition: 

MS(y)  =  {v} 

MSic)  =  {v},  when  constant  c  4  PF 
(v,c),  when  constant  c  e  PF 

(1)  MS (/■(*!•  •".**))=  {/(*i.  •  •  • .  q*)e  PF  I  qt  €  MS(r,),»  =  1,  •  •  • ,  k)  u  {v} 

After  determining  match  sets  for  constants  and  variable  occurring  in  the  subject  t,  the  main  task  of 
Hoffmann  and  O’Donnell’s  bottom-up  algorithm  is  to  identify  the  match  set  for  each  subpanem 
fit  j,  •••,**)  of  t  based  on  the  match  sets  for  r,,  i  =  1,  •  •  • ,  k.  This  is  achieved  by  solving 
expression  (1),  which  we  call  the  Basic  Bottom-Up  Step. 

Consider  a  multi-pattern  matching  problem  instance  with  pattern  set  P,  pattern  forest  PF,  and 
subject  t.  We  will  use  the  following  parameters  throughout  this  paper 
n  =  length  of  t 

m  =  number  of  match  sets  for  P 
l  =  \PF  I 
o-  \MPTM{t)\ 

kmax  -  maximum  arity  of  any  function  appearing  in  PF 

In  order  to  compute  Step  (1)  and  prim  the  set  MS(f(ti,  *  •  * ,  tk))nP  of  patterns  that  match 
fit  j,  •••,/*)  in  time  0(k+  \MS(f(t\,  ••• ,  tk))  n  P  I),  Hoffmann  and  O’Donnell  preprocess 
the  patterns  in  P  to 

i.  encode  each  pattern  in  PF  as  a  distinct  integer  from  1  to  /,  and  represent  patterns  as 
trees  in  the  obvious  way  (implemented  in  compressed  form  as  dags); 

ii.  compute  all  match  sets,  and  encode  each  such  set  as  a  distinct  integer  from  1  to  m; 

iii.  compute  the  subset  of  patterns  in  P  belonging  to  the  i'h  match  set  for  i  =  1 m; 

iv.  compute  a  transition  function  Xj  for  every  k-ary  function  symbol  /  occurring  in  P  so 
that  x jims\,  •  •  • ,  msk)  -  MSif  (fj,  •  •  • ,  tk)  whenever  msj  =  MS(tj)  for ;  =  1 ....  t,  xv 
=  {v},  and  xe  =  {v,  c }  if  c  is  any  constant  appearing  in  PF;  transition  maps  Xj  are 
implemented  as  multi-dimensional  arrays  accessed  using  integer  encodings  of  match 
sets. 

After  preprocessing  the  patterns  in  P ,  Hoffmann  and  O’Donnell’s  algorithm  solves  the 
multi-pattem  matching  problem  by  repeatedly  solving  Step  (1)  from  innermost  to  outermost  sub- 
pattern  of  t.  Their  worst  case  time  is  0(n  +  o)  after  preprocessing  P.  The  transition  table  x j  for 
each  k-ary  function  symbol  f  appearing  in  PF  uses  space,  where  the  number  m  of  match 
sets  can  be  £2(2*),  which  is  expensive  in  practice.  Their  rough  bound  on  preprocessing  time  is 
0(/2m*m“). 

Several  approaches  seem  reasonable  to  overcome  the  large  preprocessing  and  space  costs. 
Chase  [5]  saves  space  in  the  transition  function  by  eliminating  some  redundancy.  Hoffmann  and 
O’Donnell  restrict  the  class  of  patterns  to  the  Simple  Patterns  for  which  m  is  always  small  -  essen¬ 
tially  Oil).  For  Simple  Patterns  that  are  further  restricted  to  have  arity  less  than  or  equal  to  two. 
Hoffmann  and  O'Donnell  give  an  algorithm  where  match  sets  can  be  avoided  entirely. 

In  the  next  section  we  show  how  to  make  Chase’s  preprocessing  algorithm  incremental  and 
asymptotically  better  in  both  time  and  space.  We  also  present  a  general  problem  decomposition 
technique  that  allows  the  algorithm  to  be  tailored  according  to  a  spaceAime  tradeoff.  After  that,  we 
show  how  to  improve  the  space  and  time  for  Hoffmann  and  O’Donnell’s  algorithm  for  binary 
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Simple  Patterns  and  how  to  extend  the  algorithm  to  a  wide  subclass  of  Simple  Patterns  with  unres¬ 
tricted  arity. 

3.  Incremental  preprocessing 

Chase  was  able  to  improve  Hoffmann  and  O’Donnell’s  method  by  exploiting  the  deeper 
structure  of  the  pattern  set  P  to  reduce  the  size  of  transition  functions[5].  Chase’s  heuristic 
preserves  the  0(1)  per  step  matching  time. 

Let  PF  be  the  pattern  forest  for  P,  and  assume  that  it  contains  variable  v.  For  each  function  / 
appearing  in  PF,  define  projection  11/  =  {c4: f(c\,---  ,c*)  e  PF)  to  be  the  set  of  patterns  appear¬ 
ing  as  the  i‘h  parameter  of  some  /-pattern  in  PF.  Chase  made  the  crucial  observation  that  the  basic 
Bottom-Up  Step  (1)  could  be  rewritten  equivalently  as 

(2)  MS(f  (fj,  •  •  • .  tk))  =  (f(cx,  •  •  •  ,ck)  €  PF  I  a  e  MSfa)  n  n>.  i  «  1 ..  *}  u  {v} 

and  that  the  size  of  the  finite  function  (i.e.,  number  of  pairs  stored  in  its  graph  representation) 
defined  by  the  rule  Bf(MS(t\)  n  II},  ••• ,  MS (tk)  n  T\j)=MS(f  (r  j.  •  •  • ,  tk))  must  be  no  greater 
than  Hoffmann  and  O’Donnell’s  transition  function  Xj.  The  essential  idea  may  be  simply  put:  for 
any  two  finite  functions  /and  g  where /is  defined  by  the  rule  f(h(x))  =  g(x),  we  know  that  1/  <  Igl 
as  long  as  A  is  not  one-to-one.  Chase  also  provided  extensive  empirical  evidence  to  show  that  0/  is 
much  smaller  than  Xj  in  practice. 

Chase’s  bottom-up  step  involves  two  substeps.  First  each  Hoffmann  and  O’Donnell  match 
set  MSifi)  is  turned  into  the  smaller  Chase  match  set  p}(AfS (r;))= MS (r,)n  11/  for  i  =  1 k. 
Next,  Chase’s  transition  function  8/  is  used  to  obtain  the  Hoffmann  and  O’Donnell  match  set 
0/Qi}(MS(ti)), ...,  (i/(MS (/*))).  Chase’s  implementation  uses  integer  encodings  for  both  kinds  of 
match  sets. 

We  will  give  an  abstract  algorithm  that  incrementally  constructs  functions  p  and  6  and  runs 
asymptotically  faster  than  Chase’s  algorithm.  Since  our  algorithm  is  specified  in  terms  of  set  and 
map  operations,  it  is  useful  to  discuss  some  notations  and  implementation  details.  In  addition  to 
standard  mathematical  notations  it  will  sometimes  be  convenient  to  use  certain  unconventional  dic¬ 
tions.  Expression  A  with  x  abbreviates  set  element  addition  A  u  (x)  (where  in  this  context  A  is 
interpreted  as  the  empty  set  if  it  is  undefined),  and  assignment  A  op  :=  x  abbreviates  A  :- A  op  x. 
If /is  a  binary  relation,  then  domain/=  {r.  (x,y]  e  /},  range /=  {y:  [x,y]  e  f),f(x)  denotes  func¬ 
tion  application  (undefined  if  /  is  multivalued  at  x  or  if  x  4  domain  /),  and  /{x}  denotes  mul¬ 
tivalued  map  application  with  value  {y:  [x,y]  e  f). 

By  a  Set  Encoding  Structure  (abbr.  SE  Structure)  we  mean  a  triple  ( U ,  A,  Q )  with  finite 
universe  U,  primary  set  A  c  2U,  and  secondary  scIQqU.  SE-structures  support  the  following 
five  operations: 

1.  ( create )  Add  a  new  set  {z}  to  A,  and  possibly  add  z  to  Q,  where  z  e  U,  i.e., 

A  with:-  (*) 

Q  with:- 1 

2.  ( replace )  Replace  a  €  A  by  new  set  a  with  z,  which  is  denoted  by, 

a  with:- 1 

3.  (add)  Add  new  set  a  with  z  to  A,  and  perhaps  add  z  to  Q,  where  a  €  A  and  z  c  U\  that  is, 

A  with:-  a  with  t 

4.  (query)  Retrieve  set  a  n  Q,  where  a  e  A. 
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5.  (index)  Retrieve  set  [a  e  A I c  e  a},  where  c  e  U. 

We  will  implement  SE-stmctures  using  the  following  data  structure  called  an  SE-Tree  (see 
Fig.  1).  Each  set  ax  belonging  to  primary  set  A  is  associated  with  a  unique  node  x  in  the  SE-tree; 
that  is,  x  ’encodes’  ax.  Each  node  x  in  the  tree  will  be  uniquely  associated  with  a  set  ax  c  U,  which 
may  or  may  not  belong  to  A.  If  ax  and  Oy  are  sets  associated  with  tree  nodes  x  and  y,  then  x  is  a 
descendent  of  y  in  the  tree  only  if  Oy  c  ax.  Each  node  is  implemented  by  a  record  with  five  fields: 
a  right  sybling  pointer,  a  leftmost  child  pointer,  a  (2-list  pointer  to  a  subset  of  Q,  a  pointer  to  the 
nearest  ancestor  with  a  nonempty  Q-list,  and  a  membership  bit  indicating  whether  the  node 
corresponds  to  a  set  belonging  to  A  or  not  For  each  node  x,  which  represents  set  ax,  (2-lists  for 
nodes  along  the  path  from  x  to  the  root  are  mutually  disjoint,  and  their  union  stores  the  set  ax  n  Q. 
Sets  U  and  Q  are  implemented  by  a  list  of  records.  The  record  corresponding  to  each  element 
c  €  U  has  a  bit  indicating  membership  in  Q  and  a  pointer  to  a  list  (called  the  c-list)  of  tree  nodes  x 
closest  to  the  root  such  that  the  associated  set  ax  contains  c. 


A-  right  left  Q.\ist  Q-  numeric 

member  sibling  child  ancestor  list  code  U  Q  (/-list 


The  create  operation  (A  with.-  (z);  Q  with:=  z)  is  implemented  by  adding  a  new  tree  root 
with  empty  sybling,  child,  and  Q-list  ancestor  pointers,  membership  bit  on,  and  Q-list  containing  z 
if  it  belongs  to  Q  and  empty  if  not.  We  also  add  a  pointer  to  the  newly  created  record  in  the  z-list. 
This  operation  takes  0(1)  time. 

Implementation  of  replace  a  with:=  z  requires  two  cases  to  be  considered.  In  the  first  case, 
called  a  nondestructive  replace ,  the  tree  node  x  associated  with  a  has  a  nonnull  child  pointer.  Then 
(i)  unset  the  membership  bit  in  x  and  create  a  new  tree  node  y  as  a  child  of  x,  (ii)  if  the  Q-list  in  x  is 
nonempty,  then  make  the  Q-list  ancestor  in  y  point  to  r,  otherwise,  make  it  point  to  the  same  record 
that  the  Q-list  ancestor  in  x  points  to,  and  (iii)  set  the  member  bit  in  y.  In  the  second  case,  where  x 
has  no  children,  we  reuse  x  to  represent  the  new  set  a  with  z.  In  this  case,  called  a 
destructive  replace,  we  assume  that  nodes  x  and  y  are  the  same.  In  either  case,  if  z  belongs  to  Q, 
add  z  to  the  Q-list  for  y.  Finally,  add  y  to  the  z-list.  This  operation  takes  0(1)  time. 

To  implement  add  A  with:=  a  with  z  we  let  x  be  the  tree  node  associated  with  set  a.  Create  a 
new  tree  node  y  associated  with  set  a  with  z,  and  make  y  a  child  of  x.  If  the  Q-list  in  x  is 
nonempty,  then  make  the  Q-list  ancestor  in  y  point  to  x;  otherwise,  make  it  point  to  the  same  record 
that  the  Q-list  ancestor  in  x  points  to,  and  set  the  member  bit  in  y.  If  z  belongs  to  Q,  add  z  to  the  Q- 
listfory.  Finally,  add  y  to  the  z-list.  This  operation  takes  Q(l)  time. 

Operation  query  a  n  Q  is  implemented  as  follows.  If  x  is  the  tree  node  associated  with  a, 
then  retrieve  the  elements  in  each  Q-list  along  the  path  starting  from  x  following  Q-list  ancestors. 
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The  G-lists  along  this  path  are  disjoint.  This  operation  takes  0(la  n  Q\)  time. 

Finally,  SE-trees  support  a  straightforward  implementation  of  index  [a  e  A  I  c  e  a).  Form  a 
list  of  records  x,  where  set  ax  belongs  to  A,  occurring  in  subtrees  rooted  in  nodes  contained  in  the 
c-list.  This  operation  takes  0(1  {a  6  A  Ic  e  a}l)  time,  because  the  number  of  nodes  x  in  these  sub¬ 
trees  such  that  ax  4  A  must  be  strictly  less  than  the  number  of  leaves  in  these  subtrees  (and  all 
leaves  represent  sets  belonging  to  A). 

In  order  to  analyze  the  complexity  of  SE-trees,  we  give  the  following  definitions.  For  each 
node  x  in  an  SE-tree,  define  path  (x)  to  be  the  set  of  nodes  in  the  tree  path  from  the  root  to  x.  Define 
weight  (x)  to  be  the  number  of  elements  u  e  U  whose  u-list  contains  x.  Define 
W„ (A)  =  £  weight  (x)  to  be  the  total  weight  of  all  the  nodes  in  the  tree  that  implements  set 

x  it  a  tret  node 

A.  Letting  des(x)  denote  the  number  of  tree  descendents  of  x,  we  can  define 
Wp(A)  =  2  des(x)xweight(x)  to  be  the  sum  of  the  weights  of  every  tree  path.  Clearly, 

x  it  a  tret  node 

I A  I  £W„(A)  <Wp(A)  <  2  £  la  I.  Usually,  Wn(A)  is  much  smaller  than  WP(A). 

•  C  A 

The  total  space  required  by  an  SE-tree  is  O  (W„(A)),  and  any  sequence  of  h  of  the  first  three 
operations  above  takes  0(h)  time  and  space.  Note  that  a  naive  representation  of  the  set  A  will  take 
0(Wp(A))  space. 

We  will  consider  useful  variants  of  SE-structures  that  require  minor  alteration  to  the  preced¬ 
ing  implementation  and  do  not  affect  the  stated  complexities.  A  Simple  SE-structure  is  one  with  no 
secondary  set.  A  numeric  SE-structure  is  one  in  which  the  set  elements  of  the  primary  set  A  are 

identified  by  natural  numbers  1 . IAI  (cf  Fig.  1).  Numeric  SE-stnictures  have  special  importance  in 

connection  with  our  second  abstract  datatype  described  next 

The  main  abstract  datatype  used  in  our  pattern  matching  algorithm  is  the  SE-Map,  which  is  a 
partial  function  /:  A-*B  from  a  domain  set  A  to  a  range  set  B,  where  A  and  B  are  the  primary  sets 
of  two  SE-stnictures.  SE-maps  support  the  following  two  map  operations: 

1.  (modify  range)  Given  a  set  A  and  an  element  z,  where  Ac  A,  and  z  does  not  belong  to 
any  set  in  B,  add  z  to  /  (x)  for  each  r  belonging  to  A.  This  operation  is  denoted  by, 

(for  x  e  A) 
f(x)  wttb:=  i 

end 

2.  (modify  domain)  Given  a  set  x  in  the  domain  of /and  an  element  z,  form  a  new  domain 
set  x  with  z  and  map  it  under /to  the  old  image  /(x).  This  operation  is  denoted  by, 

/xwtth  i)  :=/(*) 

Our  basic  implementation  of  SE-maps  /  A— uses  SE-tree  implementations  for  A  and  B  as 
described  above.  In  addition,  for  each  pair  [a  ,b]  belonging  to  map  /,  if  x  and  y  are  the  records 
associated  with  sets  a  and  b,  then  x  stores  a  pointer  to  y,  and  y  stores  the  size  of  the  preimage  set 
fx  { b }.  If  A  is  part  of  a  numeric  SE-structure,  it  is  sometimes  useful  to  implement  domain  /as  an 
array  accessed  using  the  numeric  code  of  an  A  element  as  shown  in  Fig.2.  We  also  make  use  of  a 
multi -dimensional  SE-map  in  which  the  domain  is  the  cartesian  product  of  primary  sets  of  SE- 
structures. 

To  implement  modify  range,  (for  x  e  A)  f(x)  with:=  z  end,  we  search  through  records  associ¬ 
ated  with  elements  of  A,  and  handle  these  elements  according  to  three  different  cases.  (1)  If  there 
are  elements  of  A  not  belonging  to  the  domain  of/,  we  augment  B  with  a  new  set  {z)  using  a  create 
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operation  and  add  the  appropriate  record  pointers  and  counts. 


domain  / 


preimage 

range/  count  fi-fields 


array 

/  is  numeric  code  forxe  A 


Fig.2.  S£-map/  A  -» B 

(2)  For  each  range  element  y  e  fi A]  whose  preimage  is  entirely  contained  in  A,  we  simply  add  z 
destructively  to  y  using  a  replace  operation.  Nothing  mors  is  necessary,  since  B  is  modified  impli¬ 
citly.  (3)  For  each  element  y  e  f[ A]  not  handled  in  case  (2),  we  execute  an  add  operation  B  with:= 
y  with  z,  relink  each  element  in  A  C\f*  {y}  to  the  new  set  y  with  z,  and  modify  preimage  counts. 
The  total  cost  of  this  operation  is  0(IAI). 

The  implementation  of  modify  domain  fix  with  z)  :=  fix)  depends  on  whether  or  not  set  x  is 
modified  destructively  using  a  replace  operation  to  obtain  the  new  set  x  with  z.  If  x  is  modified 
destructively,  then  the  implementation  is  vacuous,  since  all  operations  including  the  modification 
to  A  are  implicit.  However,  if  x  is  modified  nondestructive^,  then  we  need  to  link  the  new  domain 
element  x  with  z  to  the  old  range  element  fix)  and  increment  the  preimage  reference  count,  which 
takes  0(1)  time. 

By  an  easy  counting  argument  using  the  preceding  analysis,  we  obtain  the  following  result, 
which  is  central  to  the  analysis  of  our  incremental  preprocessing  algorithm. 

Lemma  1.  Any  sequence  of  intermixed  modify  range  and  modify  domain  operations  takes 
O  (Wii(A)+W'(1(£))  dme  and  space,  where  A  and  B  arc  at  their  final  values. 

Let  F  be  the  set  of  function  symbols  appearing  in  PF.  For  each  function  fe  F,  let  A  (f)  be  its 
arity.  Let  T  be  the  set  of  Hoffmann  and  O’Donnell  match  sets.  From  the  above  discussion,  we 
know  that  the  following  equations  hold: 

r={{v,  s):se  PF  I  s  is  a  leaf }  uu  /{range  Off  e  F  I  A(f)  >  0} 
n>=  {e,-:/(ci,  ***,c*)€  PF) 
p /-  {[m,  m  n  FI)]:  m  e  T} 

0/=  {[[mi,.  ...m*],  mj:  mj  e  range  pj,  •••,*!*€  range  p* ) 
where  m  =  [f(c\,  •  •  •  ,ck)  e  PF  lc,e  miti=l, ...,k)  u  {v} 

Because  the  preceding  equations  contain  a  cyclic  dependency  in  which  T  depends  on  both  PF  and 
6,  p  depends  on  T,  and  0  depends  on  p  and  PF,  it  would  seem  that  a  costly  fixed  point  iteration  is 
needed  to  maintain  these  equations  when  PF  is  modified.  Fortunately,  this  can  be  avoided  with 
careful  scheduling. 

The  algorithm  also  depends  on  a  careful  logical  oiganization  of  the  data  into  SE-stnictures 

and  SE-maps.  Let  C)  represent  Chase  match  sets  for  /  €  F  and  r'=l . A(J).  Then  (PF,  T,  P)  is  a 

numeric  SE-structure  and  (PF,  C), .)  are  Simple  numeric  SE-structures  fotfe  F  and  i=l . A(f). 

We  also  have  the  following  SE-maps:  p/:  T -*Cf  for  feF,  i =],..., A(f),  and  0/: 
C}x...xC/V)  -*r  for  feF.  Fig.3  describes  the  data  structures  used  to  access  the  main  SE - 
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structures 

and  S£-maps  shown  in  Fig.4. 


Fig.3.  Core  data  structure 

It  is  useful  to  explain  our  incremental  algorithm  in  terms  of  three  cases. 

(case  1)  Assume,  first  of  all,  that  the  set  of  patterns  P  is  set  to  empty.  It  is  also  convenient  to 
assume  that  pattern  forest  PF  always  contains  v.  Then  we  can  initialize  variables  T,  II,  |x,  and  0  as 
follows: 


PF:«(v) 

r:={{v» 

n:=u 

H:«{) 
e. :«  (v) 

Next,  suppose  that  P  is  augmented  by  a  new  pattern  p.  In  order  to  reestablish  PF,  we  add  to 
PF  those  subpattems  of  p  not  already  in  PF  in  an  innermost-to-outermost  order.  Because  of  the 
order  in  which  updates  are  scheduled,  we  know  that  immediately  before  a  subpattem  q  of  p  is 
added  to  PF,  either  q  is  a  leaf  or  all  the  subpattems  of  q  except  for  q  itself  are  already  in  PF.  More 
importantly  we  know  that  q  is  not  the  subpattem  of  any  other  pattern  belonging  to  PF. 

(case  2)  Suppose  PF  is  augmented  with  a  constant  symbol  c.  In  this  case,  we  can  maintain  the  sys¬ 
tem  of  equations  by  executing  the  following  code  just  before  the  modification  PF  with:=  c: 

F with:*  (v)  withe 

COMMENT:  Perform  a  modify  domain  operation  on 
(for  [/',/, m]c  p((v})) 
pj((v}wlth  c) nj({v)) 

end 

et (v,c) 

To  implement  the  loop  efficiently,  for  each  match  set  m  €  T  we  maintain  a  single  doubly  linked 
list  threading  each  occurrence  of  m  within  domain  p}  for/ €  F,/*l A  (f). 
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(case  3)  The  third  and  more  difficult  case  to  consider  is  when  PF  is  augmented  with  pattern 
,/*),  where  *>0.  Below  we  describe  how  to  propagate  modifications  to  each  of  the  vari¬ 
ables  T,  n,  p.,  and  0  separately.  Recall  that  each  of  the  sets  T  and  range  p),  / e  F,  i  =  \.A  (f), 
will  be  implemented  as  SE-trees. 


Fig.4.  Data  structure  for  0/  and  p}- 


1.  Modify  Uj  before  the  modification  FF  with:=/(fi,  •  •  •  ,tk): 

(for  j 

\Itj4  TJj  then 
Tlj  wlth:=  lj 
end 
end 

2.  Perform  a  modify  range  operation  on  p^  immediately  prior  to  the  modification  Tlj 
with:=  tj  of  step  1: 

(forme  I~  1 1,- €  m) 

Uj(m)wtth:=»; 

end 

As  discussed  in  SE-tree  operation  5,  we  can  use  the  tj-list  to  retrieve  the  set 
{m  e  r  I  tj  €  m }. 

3.  Perform  a  modify  domain  operation  on  0/  prior  to  the  modification  with:=  r;  of 
step  2  if  the  modify  range  in  step  2  was  nondestructive: 

(for[m,,  domain  8/ 1  m;  =  nj(m) ) 

e/mj,  •••.mywfth/y,  •••,«;,  •••.«%) 

end 

Here0//nj,  •  •  • ,  with  f;,  •  •  • ,  m*)  =  0/(mi,  -'-.my,  m*),  because  the  pattern 

/(fj,  •  •  • ,  f4)  has  not  yet  been  added  to  PF,  and  so  no  /-pattern  in  FF  has  t}  as  its 
child.  We  can  speedup  the  search  by  using  the  index 

[mi . mk]  e  Bf).  Maintaining  all  k  such  indexes  for/along  with  0/  does  not  change 

the  overall  asymptotic  time  or  space. 
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4.  Perform  a  modify  range  operation  on  6/  just  before  the  modification  PF 
with:=/  (r i ,  •  •  •  ,tk)  and  after  the  preceding  three  steps: 

(form,  €  range  \i) . m*  e  range  uj  I  (,  e  . . !»€/«*) 

if  [mi . m,]  4  domain  8/ then 

0/m,,  {v} 

end 

e/m, . m*)with-/(<„  •••.»*) 

end 

It  is  important  to  observe  that  range  p}  is  nonempty  for  i=l,...,fc  because  of  steps  1,  2, 
and  3.  Again,  we  can  use  the  tj-list  to  search  through  the  sets  [mj  e  range  p/r,  e  my) 
instead  of  the  potentially  much  larger  sets  range  fij  ,j=l,..,k.  However,  this  step  contains 
a  new  operation  to  create  a  i -tuple  [mi,...,  m*]  and  locate  it  in  the  domain  of  8/.  Hash¬ 
ing  is  a  practical  solution  that  preserves  the  Lemma  1  space  complexity  but  makes  the 
time  randomized.  This  would  also  make  the  Bottom-Up  Step  0(1)  randomized  time. 
Our  current  implementation  uses  this  approach.  Another  way  of  preserving  space  com¬ 
plexity  at  the  expense  of  time  is  to  use  a  balanced  search  tree;  e.g.,  a  red/black  tree  [23]. 
Access  time  is  then  0(log  Idomain  0/),  and  so  is  the  Bottom-Up  Step.  Like  Chase  we 
can  also  use  a  large  table  to  store  0/,  which  doubles  its  size  and  reorganizes  whenever  it 
overflows.  If  each  new  array  is  allocated  in  unit  time  using  the  solution  to  exercise  2.12 
of  Aho,  Hopcroft,  and  Ullman’s  bookfl],  then  the  Lemma  1  time  complexity  is  preserved, 
but  the  run-time  space  requirements  for  Qf  are  increased  to  be  the  same  as  Chase. 

5.  Modify  T  prior  to  the  modification  8/mi . mt)  with:=/  (rlt  •  •  •  ,f*)  of  step  4  if  the 

modify  range  operation  of  step  4  was  nondestructive: 

rwfth^e/mj.  •••, #*) with /(r„  •,/*) 

Since  / (rj  ,••  •,/*)  is  a  new  subpattem,  no  other  subpattem  in  PF  has  / (ri ,  •  •  •  ,r*)  as  a 
subpattem.  Thus  no  further  modification  is  needed  for  n. 

6.  Perform  a  modify  domain  operation  on  p  just  before  the  modification  I~ 
with:=  0/m  i ,  •  *  * ,  m*)  with  /(f  i ,  •  *  •  ,r*)  of  step  5: 

(for  (/.*.«!  €  Ji{8/m ] ,  ■ 

end 

Observe  that  within  the  preceding  code  p/0/m  i,  •  •  • ,  mk)  with  f(t\,  •  ■  •  ,tk))  = 
p^(0/m  i,  •  •  • ,  mt)),  because  /  (r  i ,  *  •  •  ,t*)  4  .  The  implementation  is  the  same  as  in 

case  2. 

Now  we  compare  the  time  and  space  complexity  of  Chase’s  algorithm  and  our  algorithm  . 
THEOREM  1. 

1 .  For  each  m  €  domain  pf  where /  e  F,j=\.A(f),  Otase’s  algorithm  computes  p/m)  in 
Q(min(  I  m  I ,  III/))  time,  whereas  our  algorithm  takes  O  ( I  p/m)  I )  time. 

2.  For  each  [mi,  •  •  • .  m*]  in  domain  0/,  Chase’s  algorithm  computes  0/mj,  •  •  • ,  mk)  in 
12(min(iFF  l.lmi  x  •••  x  m*  I ))  time,  whereas  our  algorithm  takes  O  ( 1 0/m  i ,  •  •  • ,  m*)l)  time. 

3.  We  use  0(W„(T))  auxiliary  space  to  represent  the  set  T,  whereas  Chase  uses  OCH^fT)) 
space. 

4.  To  represent  the  range  of  p/  we  use  0(W„(range  p/)  auxiliary  space,  whereas  Chase 
uses  OCW/range  p/)  space. 
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Proof  Sketch)  In  both  algorithms,  the  time  complexity  is  dominated  by  the  time  needed  to 
construct  the  tables  pi}  and  8/,  where /e  F  and  j  =1  .A  (f). 

1.  For  each  m  e  domain  \ij.  Chase’s  algorithm  computes  \ij(m)  by  intersecting  tr.  and  Uj, 
and  thus  takes  ft(min( I m  I,  in}l))time.  By  Lemma  1,  we  spend  0(1  pi}(m)l)  time  to  establish  the 
value  of  pi  fan). 

2.  For  each  [mlt  •  •  • ,  m*]  in  domain  Bp  Chase’s  algorithm  computes  Bj{m\ ,  •  •  • ,  m*)  by 
evaluating  the  set  [f  (c  i ,  •  •  • ,  ck)  e  PF I  [c  i ,  •  •  • ,  c*]  e  m  i  x  •  •  •  x  m* )  naively  and  thus  takes 
£2(min(IPF  I,  l/nj  x  •••  xm*l))time.  In  our  algorithm  ,  the  initial  value  Bj{m  j ,  •  ■  ■ ,  m*)is  {v} 
by  default.  Then  it  gets  new  values  in  step  3  by  copying,  and  increases  one  element  at  a  time  in 
step  4.  Thus  we  spend  0(l8/(mj,  •  •  • ,  m*)l)  time  to  establish  the  value  of  Bj(m\,  •  •  • ,  m*). 
Usually  0y<m  j ,  •  •  • ,  mk)  is  much  smaller  than  either  PF  or  m  i  x  •  •  •  x  mk. 

3  and  4.  Follows  from  Lemma  1. 

We  briefly  mention  that  deleting  patterns  from  P  can  be  handled  much  like  pattern  addition, 
except  that  scheduling  pattern  deletion  from  PF  is  in  an  outermost-to-innermost  subexpression 
order.  Further,  a  pattern  is  deleted  from  PF  only  if  its  parent  is  not  in  PF.  The  deletion  algorithm 
follows  the  same  logic  as  the  addition  algorithm  but  in  a  backwards  order  to  undo  the  effect  of 
addition.  Details  will  be  provided  in  a  fuller  version  of  the  paper. 


4.  Space/Time  tradeoff 

In  Chase’s  algorithm,  for  each  function  symbol  f  e  F  of  arity  k,  the  space  required  for  the  8 / 
table  could  be  Cl(2lk).  Here  we  give  a  method  that  decomposes  8/  into  p  tables  with  worst  case 
overall  space  O  (p  ( 2ltUp ))  but  leads  to  time  0(p)  to  solve  the  Basic  Bottom -Up  Step. 

Let  PF  be  partitioned  into  p  disjoint  equal  size  sets  PF\ . PFp,  and  consider  equations, 

njrj  =  {c,: / (C| ,  •  •  •  ,Ct)  €  PFj} 
pi}.;  =  {[m,  m  n  n)  y]:  w  e  T) 

6/,y  =  {[[«! . md.  m]:mxe  range  pt}j,  •  •  • ,  m*  e  range  pi},; } 

where  m  =  {f(c j,  •  •  •  ,c*)  €  PF }  lc,e  m,,i=l . it)  u  (v) 

If  MS,  =  .....  pi},, (MS  (r*))),  then  we  can  compute  disjoint  unions  MS(/(r, . tk)) 

=  MS,u  •••uMSpinO(p)time. 

Consider  the  space  required  by  this  approach.  If  r},;  =  I  range  pi},;  I ,  then  r'jj  =  0(  2tUfJ')  = 
0(2'PFj')  =  O (2llp),  and  18/,;!  =  0 (r}jK..xr* j)  =  O (2u/p).  Thus,  the  total  space  storing  the  p 
match  tables  for  function  symbol  /  is  0(p( 2tk/p)),  which  for  p  >  1  is  asymptotically  better  than 
Chase’s  algorithm  in  the  worst  case. 


The  space  required  by  each  pi  table  is  always  ITI.  Thus  the  total  space  for  the  tables  pi},;, 
i=l.. L,  j  -  \.p  is  now  pk  1  Tl ,  and  the  total  space  for  the  pi  and  0  tables  for  function  symbol  /  is 

Ik 

0(pk\r\+p(2lkJp)).  When  p  =  - — —  ,  - ,  we  obtain  the  approximate  minimum 

log(xiri) 

Ik2 

o ( — — — iri). 

^iog(*iri)  h 


To  further  reduce  the  size  of  pi  tables,  we  can  split  each  pi},;  table  into  p  subtables  pi},;, 
r  =  l.p,  with  domain  pi},;  =  {  xnPF,:x  €  T).  Then  for  xeT,  we  have  pi },;(x)  = 
pi}  j(x  nPF  j )  u  ••  •  u  nfyx  n  PFp)},  which  can  be  computed  in  p  time.  This  increases  the  time 
per  step  to  0(p2). 
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The  total  size  of  the  p  subtables  is  now  bounded  by  0(p2llp),  and  the  total  space  for  the  ji. 
and  6  tables  for  function  /is  0(kp22tlp  +p2IkJp).  Since  this  approach  is  meaningful  only  for  step 
time  complexities  better  than  O  (/),  i.e.,  p  =  0  (/1/2),  the  best  upper  bound  we  can  get  in  this  case  is 
0(lil2 2cU  )  for  some  constant  c.  This  result  also  indicates  that  this  approach  is  useful  only 
when  in  »2/12. 

In  a  practical  implementation  it  is  not  necessary  for  PF  to  be  partitioned  into  disjoint  equal 
size  subsets.  For  example,  we  can  let  PF\  be  the  set  of  patterns  that  are  not  children  of  any  pat¬ 
tern,  PFi  be  the  set  of  children  of  patterns  in  PF,_i  not  contained  in  PFj,  where 
i  =  1  .maximum  height  of  patterns,  j  <  i.  Then  the  tables  n‘/j  can  be  omitted  for  t  >  j- 1.  Alterna¬ 
tively,  we  can  let  PF ,  be  the  set  of  all  children  of  patterns  in  PF,_ j .  Now  the  size  of  each  subset 
may  grow,  but  the  tables  can  be  omitted  for  all  t  *  j -l .  It  is  an  interesting  question  how  to 
find  a  partition  of  PF  that  minimizes  the  table  size  for  a  fixed  per  step  time  bound. 

5.  Match  set  elimination 

Hoffmann  and  O’Donnell  [10]  considered  two  subclasses  of  patterns  for  which  the  prepro¬ 
cessing  and  space  costs  for  bottom-up  multi-pattern  matching  are  greatly  reduced. 

Definition:  A  set  P  of  patterns  is  Simple  if  for  every  two  distinct  patterns  p,  q  e  PF,  either 
(1)  p  <  q,  (2)  q  <  p,  or  (3)  3  subject  t  I  /  £  q  and  t&p. 

For  Simple  Patterns  P  Hoffmann  and  O’Donnell  observed  that  the  partial  ordering  (PF,  <) 
could  be  represented  by  a  directed  tree  (called  a  subsumption  tree)  with  v  at  the  root  (assuming  that 
v  occurs  in  P).  Each  match  set  equals  the  set  of  patterns  along  some  path  in  the  subsumption  tree 
from  a  node  to  the  root.  And  every  path  from  a  node  to  the  root  determines  a  match  set  Thus, 
there  are  only  /  match  sets,  and  each  one  can  be  represented  by  its  minimum  partem.  For  a  func¬ 
tion  / of  arity  k,  the  transition  table  t/  uses  0(lk)  space,  a  great  improvement  over  the  general  case 
but  still  expensive.  Hoffmann  and  O’Donnell  also  argue  that  most  sets  of  patterns  they  have 
encountered  in  rewriting  systems  are  Simple  or  can  be  turned  into  equivalent  Simple  sets. 

Hoffmann  and  O’Donnell  also  looked  at  a  subclass  of  binary  Simple  Patterns;  i.e..  Simple 
Patterns  in  which  the  maximum  arity  of  any  function  is  two.  Although  greatly  restricted,  this  class 
is  interesting,  because  conventional  arithmetic  and  operations  in  combinatory  logic  have  arity  less 
than  or  equal  to  two.  Also,  Hoffmann  and  O'Donnell  showed  that  naive  transformation  of  patterns 
with  arity  greater  than  two  into  binary  form  sometimes  but  not  always  preserves  the  Simple  Partem 
property.  For  binary  Simple  Patterns  they  gave  an  algorithm  requiring  no  transition  tables,  but 
uses  0(l2)  space,  0(lh2)  preprocessing  time  (h  is  the  longest  path  in  the  subsumption  tree),  and 
0(h2)  time  instead  of  an  0(1)  time  for  Step  (1). 

We  will  give  a  bottom -up  algorithm  for  binary  Simple  Patterns  (which  extends  to  a  subclass 
of  Simple  Patterns  with  arbitrary  arity)  with  0(1)  space  and  0(log  l)  time  per  step.  Our  Prepro¬ 
cessing  time  is  the  same  as  that  of  Hoffmann  and  O’Donnell.  The  algorithm  makes  use  of  persis¬ 
tant  search  trees  [20],  and  we  expect  it  to  be  fast  in  practice. 

Let  PF  be  the  pattern  forest  for  the  set  P  of  patterns,  and  let  T  be  its  subsumption  tree.  Recall 
that  for  Simple  Patterns  each  match  set  can  be  represented  by  the  unique  minimum  pattern  in  the 
set  If  pi  represents  the  match  set  for  subpattem  t,  of  the  subject,  t  =  1 ..  k,  then  the  match  set  for 
f(t\,  •  *  •  ,h)  is  represented  by  the  pattern  determined  by  the  following  formula: 

(New  Bottom -Up  Step): 

(3)  min  /  ((v)  <J  (f(q\,  •  ■  • ,  q*)  e  PF  I  2p„  i  =  1 ..  k }) 

We  call  pattern  f(p\,  •  •  • ,  pk)  the  search  argument  for  Step  (3). 
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Consider  any  binary  function  / appearing  in  PF,  and  let/(pj,  p2)  be  the  search  argument  for 
Step  (3).  (We  will  not  discuss  unary  patterns  and  constants,  which  are  simpler  subcases.)  We  want 
to  analyze  (i)  the  worst  case  cost  of  performing  Step  (3);  and  (ii)  the  auxiliary  space  while  execut¬ 
ing  Step  (3). 

An  important  observation  is  that,  unlike  patterns  p  i  and  p2,  search  argument  f(p  j ,  p2)  may 
not  belong  to  the  subsumption  tree  T!  Consequently,  if  we  let  1  denote  the  unique  maximum  pat¬ 
tern,  and  if  we  define  relation  R  =  {[x,  y]:/(x,  y)  e  PF)  u  ([1,1]),  then  we  can  replace  Step  (3) 
for  search  argument  f  (p\,  p2)  more  conveniently  by, 

(4)  min  /  {[x,  y]  6  R  I  x  £pi  andy  Zp2] 

Expression  (4)  can  be  computed  by  locating  the  pair  of  nearest  ancestors  belonging  to  R  of 
nodes  p\  and  p2  with  respect  to  subsumption  tree  T.  This  characterization  is  meaningful  because 
of  the  following  proposition. 

Proposition:  If  [xj,  yj]  and  [X2,  y2]  are  any  two  pairs  in#  and  Xi  <X2,theny2  ^yj. 

Proof  Otherwise,  P  would  not  be  Simple:  i.e.,  we  would  have  f(x\,  y2)  <  f(x\,  yi)  and 
f(xuy2)<f(x2,y2). 

In  order  to  compute  (4)  efficiently,  the  difficulties  of  two  dimensional  ancestor  testing  and 
searching  within  partially  ordered  sets  need  to  be  overcome.  This  is  done  by  reducing  the  two 
dimensional  nearest  ancestor  search  in  tree  T  to  single  dimensional  searching  through  a  totally 
ordered  set.  The  essential  idea  is  presented  just  below. 

Let  #{x]  denote  the  set  (y :  [x,  y]  e  #},  and  let  domain  R  denote  the  set  (x:[x,y]e  R).  For 
each  xe  domain  R,  define  set  S (x)  =  uy2x/?{y);  for  each  z  e  S  (x)  define  witness 

w(x,  z)  =  minimum  y  £  x  such  that  [y,  z )  e  R 
Then  we  can  compute  (4)  by  performing  these  two  queries: 

(5)  i.  (?o  =  n>in /{xe  domain^  I  xipj) 
ii.  q2  =  min/(ye  S(q0)  I  y  Zp2) 

If  either  qQ  orq2  equals  1,  then  v  is  the  answer,  otherwise,  we  obtain /(w(^0.  q2)>  qi)- 

The  two  queries  (S)  reduce  computation  (4)  to  finding  single  dimensional  nearest  ancestors 
and  computing  and  storing  sets  S(x).  Nearest  ancestors  in  trees  can  be  computed  efficiently  based 
on  the  following  idea.  Let  pre  ( i )  and  des  (i)  be  the  preorder  number  and  descendent  count  of  node  i 
in  tree  T.  Then  node  i  is  an  ancestor  of  node  j  iff  pre  (i)  S  pre  ( j )  <  pre  (i)  +  des  (i);  also,  if  i  and  k 
are  both  ancestors  of  j,  then  i  is  nearer  to  j  than  k  iff  pre  (j)  >  pre  (k). 

Let  Q  be  any  subset  of  the  nodes  in  T.  Then  for  any  node  p  in  T,  we  can  compute 

(6)  min  /[xe  Q  I  x  tp) 

whenever  a  solution  exists  by  finding  the  node  i  in  Q  with  maximum  pre(i)  such  that 
pre(i)  £ pre(p)  <  pre(i)  +  des(i).  To  facilitate  this  computation  we  can  preprocess  Q  as  follows. 
For  all  /  in  Q  define  function  find  (pre  (i))  =  i  and  find  ( pre  (i)  +  des  (/)) = j  such  that  pre  (J)  is  the 
maximum  for  which  pre  (J)  S  pre  (i)  +  des  (i)  <  pre  (J)  +  des  (J)  and  j  e  Q.  Hence,  (6)  can  be 
solved  by  computing  find(x),  where  x  is  the  greatest  element  in  domain  find  such  that  x  Zpreip). 

We  can  store  domain  find  as  either  a  red/black  tree  [8,23]  or  Willard’s  variant  of  the  Van 
Emde  Boas  priority  queue[24,25]  and  obtain  the  following  time/space  bounds.  Both  data  struc¬ 
tures  use  space  0(\Q\).  Computing  query  (6)  costs  0 (log I <2 ')  red/black  trees,  and 
0(loglog/)  with  priority  queues  (where  /  is  the  number  of  nodes  in  T). 
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Based  on  the  preceding  analysis,  we  can  perform  query  (5),  (i)  with  0(1)  space  overall  if  we 
store  all  of  the  domains  of  relations  R  for  each  binary  function  /  appearing  in  T  either  as  red/black 
trees  or  Van  Emde  Boas  priority  queues.  Query  time  is  0(\ogl)  using  red/black  trees,  O  (loglog/) 
with  priority  queues. 

For  query  (5),  (ii)  we  can  store  all  of  the  sets  S  (go)  and  their  witnesses  using  a  minor  variant 
of  the  persistent  search  tree  of  Samak  and  Tarjan  [20].  Recall  that  a  persistent  search  tree  can  store 
a  sequence  To,  T\,  •  •  • ,  Tr  of  sets,  where  To  is  empty  and  T;  is  formed  from  T,_j  by  element 
addition  or  deletion  for  i  =  1,  •  •  • ,  r.  The  data  structure  takes  up  0(r)  space  and  can  support  the 
nearest  neighbor  operation  pred(i,  x)  =  max  /  [ye  T,  I  y  £  x)  in  O  (log  r)  worst  case  time. 

In  our  application  the  sequence  of  sets  is  obtained  by  traversing  the  subsumption  tree  T  in 
preorder,  adding  as  we  arrive  at  node  x  from  its  parent,  and  deleting  /?{*}  when  we  go  back 
from  x  to  its  parent  Hence,  the  sets  S  (x)  for  x  in  domain  R  are  included  as  a  subsequence. 
Witnesses  are  stored  using  stacks  inside  the  search  tree.  Since  each  set  R[x]  is  added  and  deleted 
once  in  forming  the  sequence,  the  size  r  of  our  sequence  is  just  IR  I ,  which  is  also  the  number  of 
distinct  patterns  with  root  /  appearing  in  PF.  Thus,  query  (5),  (ii)  can  be  computed  in  O  (log/) 
time,  and  the  cumulative  space  for  storing  persistent  search  trees  for  all  the  binary  functions  / 
appearing  in  PF  is  just  O  (/).  Thus,  we  have 

THEOREM  2.  Step  (3)  can  be  computed  for  binary  Simple  Patterns  in  OQogl)  time  and  0(1) 
space. 

Extending  the  preceding  idea  to  functions  of  arbitrary  arity  is  straightforward. 

Definition:  A  k-ary  function  symbol  /  is  Very  Simple  if  there  exists  a  ^-permutation  g  such 
that  for  i'=l,...,*-l  and  every  two  distinct  /  patterns  f(xi,...jck)  and  fiy\,...,yk),  xtjZytj  y=l 
implies  *,4>1  *ytM. 

Any  Very  Simple  function  /  in  a  Simple  pattern  forest  can  be  handled  without  a  transition 
map.  Our  algorithm  runs  in  step  time  O(kmox  log  l)  and  total  auxiliary  space  0(kmax  l)  for  all 
Simple  functions  together,  where  kmax  is  the  greatest  arity  of  any  Very  Simple  function  appearing 
in  PF. 

6.  Conclusion 

We  believe  that  a  deeper  analysis  and  exploitation  of  the  structure  of  pattern  matching  can 
lead  to  further  algorithmic  improvements.  In  a  subsequent  paper  we  will  report  how  to  extend  the 
algorithms  presented  here  to  a  more  complex  pattern  language,  which  is  used  to  perform  semantic 
analysis  within  RAFTS. 

Acknowledgements  We  are  grateful  for  stimulating  discussions  about  pattern  matching  with 
David  Chase,  Chris  Hoffmann,  and  Ken  Perry.  We  also  thank  the  CAAP  referees  for  helpful  com¬ 
ments. 

References 

1.  Aho.  A.,  Hopcroft,  J„  and  Ulbnan,  J„  Design  and  Analysis  of  Computer  Algorithms,  Addison-Wesley,  1974. 

2.  Bonder,  J„  Monclce,  U„  and  Wilhelm,  R„  Table  Compression  for  Tree  Automata,  Lehntuhl  fur  Inform  atik  III, 
Universitai  des  Saarlander,  1987. 

3.  Burgh ardt,  J„  “A  Tree  Pattern  Matching  Algorithm  with  Reasonable  Space  Requirements,’'  in  Proc.  CAAP  ’ 88 , 
ed.  M.  Daudet  and  M.  Nivat,  Lecture  Notes  in  Computer  Science,  vol.  299,  pp.  1-15,  Springer-Verlag,  1988. 

4.  Cai,  J.  and  Paige,  R.,  "The  RAPTS  Transformational  System  -  A  Proposal  For  Demonstration,"  in  ESOP  '90  Sys¬ 
tems  Exhibition,  May  1990. 


-15- 


5.  Chase,  D.,  “An  improvement  to  bottom-up  nee  pattern  matching,*’  in  Proc.  Fourteenth  Annual  ACM  Symposium 
on  Principles  of  Programming  Languages,  pp.  168-177,  January,  1987. 

6.  Donzeau-Gouge,  V.,  Huet,  G,,  Kahn,  G.,  and  Lang,  B.,  “Programming  environments  based  on  structured  Editors: 
die  Mentor  Experience,"  in  Interactive  Programming  Environments ,  ed.  D.  Bars  tow,  H.  Shrobe,  and  E.  San- 
dewall,  McGraw-Hill,  1984. 

7.  GivJer,  J.  and  Kieburtz,  R.,  "Schema  Recognition  for  Program  Transformations,”  in  ACM  Symposium  on  USP 
and  Functional  Programming,  pp.  74-85,  Aug,  1984. 

8.  Guibas,  L  and  Sedgewick,  R.,  “A  dichromatic  framework  for  balanced  trees,”  in  Proc.  19th  IEEE  FOCS,  pp. 
157-184. 1978. 

9.  Hatcher,  P.  and  Christopher,  T.,  “High-Quality  Code  Generation  Via  Bottom-Up  Tree  Pattern  Matching,”  in 
Proceedings  13th  ACM  Symposium  on  Principles  of  Programming  Languages,  pp.  119-130,  Jan,  1986. 

10.  Hoffmann,  C.  and  O'Donnell,  J„  “Pattern  Matching  in  Trees,”  JACM,  vol.  29,  no.  1,  pp.  68-95,  Jan,  1982. 

1 1.  Hoffmann,  C.  and  O'Donnell,  M.,  “Programming  with  Equations,"  ACM  TOPLAS,  vol.  4,  no.  1,  pp.  83-1 12,  Jan., 

1982. 

12.  Hudak,  P„  “Conception.  Evolution,  and  Application  of  Functional  Programming  Languages,"  ACM  Computing 
Survey,  vol.  21,  no.  3,  pp.  359-411,  Sep.  1989. 

13.  Huet,  G.  and  Lang,  B„  "Proving  and  Applying  Program  Transformations  Expressed  with  Second-Order  Pat¬ 
terns,"  Aero  Information,  vol.  1 1.  pp.  31-55,  1978. 

14.  Knuth,  D.  and  Bendix,  P.,  “Simple  Word  Problems  in  Universal  Algebras,”  in  Computational  Problems  in 
Abstract  Algebra,  ed.  Leech.  J.,  pp.  263-297,  Pergamon  Press,  1970. 

15.  Kosaraju,  S.,  "Efficient  Tree  Pattern  Matching,”  in  Proc.  FOCS  '89,  Oct,  1989. 

16.  Maluszynsld,  J.  and  Komorowsld,  H.  J.,  “Unification-free  execution  of  logic  programs,”  IEEE  Proceedings  of 
symposium  on  logic  programming,  Boston,  1985. 

17.  Pelegri-Llopart,  E.  and  Graham,  S.,  “Optimal  Code  Generation  for  Expression  Trees:  An  Application  of  BURS 
Theory."  in  Proceedings  15th  ACM  Symposium  on  Principles  of  Programming  Languages,  pp.  294-308,  Jan, 
1988. 

18.  Pfenning,  F.  and  Elliott,  C..  "Higher -Order  Abstract  Syntax,”  in  Proceedings  SIGPLAN  ‘88  Corf,  on  Prog.  Lang. 
Design  and  Implementation,  pp.  199-208,  June,  1988. 

19.  Purdom,  P.  and  Brown,  C-  “Fast  Many-to-one  Matching  Algorithm."  in  Proc.  RTA  '85,  ed.  J.-P.  Jouannaud,  Lec¬ 
ture  Notes  in  Computer  Science,  vol.  202,  pp.  407-416,  Springer-Verlag,  1985. 

20.  Samak,  N.  and  Taijan,  R-,  “Planar  Point  Location  Using  Persistent  Search  Trees,"  CACM,  vol.  29,  no.  7,  pp. 
669-679,  July,  1986. 

21.  Sethi.  R.,  Programming  Languages:  Concepts  and  Constructs,  Addison- Wesley,  1989. 

22.  Standish,  T.,  Kibler,  D.,  and  Neighbors,  J*  “The  Irvine  Program  Transformation  Catalogue,”  Univ.  of  Cal.  at 
Irvine.  Dept,  of  Information  and  Computer  Science,  Jan,  1976. 

23.  Tarjan,  R.,  Data  Structures  and  Network.  Algorithms,  SIAM,  1984. 

24.  Van  Emde  Boas,  P„  "Preserving  Order  in  a  Forest  in  Less  Than  Logarithmic  Time  and  Linear  Space,”  I  PL,  vol. 
6.  pp.  80-82, 1977. 

25.  Willard,  D.,  “Log-Logarithmic  Worst-Case  Range  Queries  are  Possible  in  Space  0(N).“  IPL,  vol  17,  pp.  81-89, 

1983. 


